Skip to content

Use native NCCL Alltoall API when able.#136

Merged
romerojosh merged 2 commits into
mainfrom
nccl-alltoall-fastpath
Jun 11, 2026
Merged

Use native NCCL Alltoall API when able.#136
romerojosh merged 2 commits into
mainfrom
nccl-alltoall-fastpath

Conversation

@romerojosh

Copy link
Copy Markdown
Collaborator

NCCL 2.28.3 introduced a native NCCL Alltoall API. In recent NCCL releases, using this API opens up some potential performance benefits when combined with symmetric memory registration. This PR introduces usage of the NCCL Alltoall API where currently feasible without additional changes for symmetric memory benefits. Those will be made in a subsequent PR.

Signed-off-by: romerojosh <joshr@nvidia.com>
@romerojosh

Copy link
Copy Markdown
Collaborator Author

/build

@github-actions

Copy link
Copy Markdown

🚀 Build workflow triggered! View run

@github-actions

Copy link
Copy Markdown

✅ Build workflow passed! View run

Signed-off-by: Josh Romero <joshr@nvidia.com>
@romerojosh romerojosh merged commit 0dc0416 into main Jun 11, 2026
4 checks passed
@romerojosh romerojosh deleted the nccl-alltoall-fastpath branch June 11, 2026 16:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant